Regret bound
Choose
so that:
Here
is called the regret of our solution sequence
.
Regret compares to the best fixed solution in hindsight.
We typically
to be growing sublinearly in
.
- Either
are similar or changing slowly, so we can learn predict
from earlier functions.
- Or
are very different, in which case
is large, so regret bound is easy to achieve.
- Or we live somewhere in the middle.
#incomplete ****
Online regret bound
References:
- http://sbubeck.com/LecturesALL_Bubeck.pdf
- https://www.jmlr.org/papers/volume11/jaksch10a/jaksch10a.pdf
- https://arxiv.org/abs/2206.04640
- https://datascience.stackexchange.com/questions/62141/what-are-regret-bounds
- https://cs.adelaide.edu.au/~javen/talk/bounds_slide5.pdf